PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG021750t1
Common NameTCM_021750
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family MYB
Protein Properties Length: 363aa    MW: 41409.5 Da    PI: 9.3093
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG021750t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.47.7e-0982127346
                       SS-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHHHHHHHH CS
   Myb_DNA-binding   3 rWTteEdellvdavkqlGgg...tWktIartmgkgRtlkqcksrwqk 46 
                       +WT+eE + + +a +++ ++   +W  +a++++ g+t ++++  ++ 
  Thecc1EG021750t1  82 KWTPEENKCFENALALYDKDtpdRWFMVAAMIP-GKTVEDVIKQYRE 127
                       8********************************.*******999986 PP

2Myb_DNA-binding46.96.5e-15190234347
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
   Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                       +WT+eE+ +++ + k++G+g+W+ I+r + ++Rt+ q+ s+ qky
  Thecc1EG021750t1 190 PWTEEEHRQFLMGLKKYGKGDWRNISRNFVTTRTPTQVASHAQKY 234
                       8*******************************************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS512939.16878133IPR017884SANT domain
SMARTSM007175.1E-1079131IPR001005SANT/Myb domain
SuperFamilySSF466891.66E-1181135IPR009057Homeodomain-like
PfamPF002493.4E-782127IPR001005SANT/Myb domain
CDDcd001671.01E-782128No hitNo description
Gene3DG3DSA:1.10.10.604.7E-13181240IPR009057Homeodomain-like
PROSITE profilePS5129420.486183239IPR017930Myb domain
SuperFamilySSF466891.93E-17185238IPR009057Homeodomain-like
TIGRFAMsTIGR015571.6E-17186238IPR006447Myb domain, plants
SMARTSM007172.1E-13187237IPR001005SANT/Myb domain
PfamPF002492.5E-12190234IPR001005SANT/Myb domain
CDDcd001677.26E-11190235No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 363 aa     Download sequence    Send to blast
MASHFSAFFY LGFSLSLNIS FKEGEKQRSS SFFFFLYGQI NCWQLHNYKK KREMMNRGLE  60
VLSPASYLQT SNWLFQESRG TKWTPEENKC FENALALYDK DTPDRWFMVA AMIPGKTVED  120
VIKQYRELEE DVSDIEAGLI PIPGYSSDSF TLEWVNDSQG FDGFRQYYTP GGKRGAGTRP  180
SDQERKKGVP WTEEEHRQFL MGLKKYGKGD WRNISRNFVT TRTPTQVASH AQKYFIRQLN  240
GGKDKRRSSI HDITTINVPD TPSSSPDHSK PLSPNNSAAV MQAQQQPKVA GVTKELLEWK  300
QQNEGAAMIF NQTSGNAFLS PFCGISSYGP KVDEQNFLRG TLPRSQFGSY NTLFQMQSMQ  360
RQ*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2cjj_A1e-1680147875RADIALIS
Search in ModeBase
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00296DAPTransfer from AT2G38090Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJN1277720.0JN127772.1 Theobroma cacao clone TCC_BC076K20, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007026779.10.0Duplicated homeodomain-like superfamily protein
SwissprotQ8S9H71e-116DIV_ANTMA; Transcription factor DIVARICATA
TrEMBLA0A061ER130.0A0A061ER13_THECC; Duplicated homeodomain-like superfamily protein
STRINGPOPTR_0006s09860.11e-174(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM122827100
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G38090.11e-137MYB family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]